Scaling Laws5 pages tagged "Scaling Laws"What are large language models?What are scaling laws?Can we get AGI by scaling up architectures similar to current ones, or are we missing key insights?What is compute?What is the "Bitter Lesson"?